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T o make sense of the information that 
agents gather from the Web, they need to 
reason about it. If the information is pre- 
cise and correct, they can use engines such as the- 
orem provers to reason logically and derive cor- 
rect conclusions. Unfortunately, the information 
is often imprecise and uncertain, which means 
they will need a probabilistic approach. 

More than 150 years ago, George Boole pre- 
sented the logic that bears his name in An Inves- 
tigation of the Laws of Thought on Which are 
Founded the Mathematical Theories of Logic and 
Probability. The title indicates the concern that 
classical logic is not sufficient to model how peo- 
ple do or should reason. Adopting a probabilistic 
approach in constructing software agents and 
multiagent systems simplifies some thorny prob- 
lems and exposes some difficult issues that you 
might overlook if you used purely logical 
approaches or (worse!) let procedural matters 
monopolize design concerns. Assessing the qual- 
ity of the information received from another agent 
is a major problem in an agent system. 

Here, we describe Bayesian networks and illus- 
trate how you can use them for information qual- 
ity assessment. 


Bayesian Networks 
A Bayesian network is a pair (G,P), where G = (V,E) 
is a directed acyclic graph and P is a multivariate 
probability distribution defined on variables that 
correspond to the nodes of G, with the property that 
each variable is probabilistically independent of its 
nondescendants given its parents. P factorizes as the 
product of conditional probabilities of each node 
given its parents, where the probability of a node 
with no parents is just the probability of the node. 
Very often an agent does not know a proposi- 
tion for certain, but still needs to decide a course 
of action. It is appropriate to model the uncer- 
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tainty using probability theory, which has a long 
history and a clear justification. For many years, 
AI researchers believed that probability theory 
was too complicated to be applied directly, so 
they invented several alternative logical and 
numerical frameworks, which almost invariably 
turned out to be approximations of probability 
theory, anyway. 

Bayesian networks found acceptance in part 
because of their ability to represent concisely a 
multivariate distribution by exploiting indepen- 
dence relations in the domain of interest. Before 
Bayesian networks were invented, a probabilistic 
modeler had to assume that variables in the 
domain were fully or conditionally independent 
to make the model tractable. This is an unrealis- 
tic assumption. 

When using a probabilistic model, the compu- 
tation of posterior probabilities (“beliefs”) on the 
presentation of evidence is most important. This 
is, for example, what would be necessary in a 
medical decision-support system, where a physi- 
cian needs to know about a disease’s probability 
in a patient with certain symptoms. The physician 
would then use the probability to assess the risk 
or expected utility of medical procedures (or fur- 
ther testing). Similarly, a decision analyst in the 
intelligence community would assess an event-of- 
interest’s probability on the basis of evidence, typ- 
ically in the form of reports. The analyst would 
then use this probability to assess the expected 
utility of actions (which may include acquiring 
further information). 


Assessing Information Quality 

Undirected cycles in Bayesian networks can serve 
to handle redundant information, such as rumors. 
For example, consider the situation in which a cat- 
astrophic event, the Chernobyl explosion, occurs 
and is reported by three different media outlets. 
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Agents on the Web 


Figure |. Dataflow graph for Chernobyl example.This graph is not a Bayesian 


network structure. 


Figure 2. Bayesian network structure for the Chernobyl example. 


Figure 3. Naive Bayes model for Chernobyl example. 


A naive observer might believe more 
strongly that an event occurred if it 
were reported in three media rather 
than just one. However, if we know 
that the information reported relies on 
a common source, such as a telephone 
interview, the report’s reliability has 
not increased at all. 

Correct processing of situations like 
this requires a nontruth-functional 
view of evidence update; in other 
words, the belief in a proposition 
depends on more than just the propo- 
sitions that directly compose it. In the 
example, our belief in the Chernobyl 
explosion depends not only on belief 
in the three media reports, but also on 
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a model of the dependencies among 
them. From the information flow per- 
spective, we could model the situation 
as in Figure 1. 

To model the uncertainty present in 
a situation, it is almost always prefer- 
able to order variables causally. In the 
Chernobyl example, this results in the 
Bayesian network of Figure 2, where 
we capture the fact that the telephone 
interview is caused by the catastroph- 
ic event at Chernobyl, while the three 
media reports are each prompted by 
that single interview. 

A qualitative analysis of the 
Bayesian network in Figure 2 indicates 
that the three media reports are not 
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(unconditionally) independent; they 
are conditionally independent when 
PhoneInterview is known; they are 
not conditionally independent when 
ThousandDead is known but PhoneIn- 
terview is unknown. These relations 
do not depend on the model’s numer- 
ical specification and can be derived 
by applying a purely graphical criteri- 
on known as d-separation.!? 

In contrast to the Bayesian net- 
work of Figure 2, in the Bayesian 
network model of Figure 3, when 
ThousandDead is known, the media 
reports become conditionally inde- 
pendent (making it a naive Bayes 
model). The model of Figure 2 can 
simulate this (and any other) feature 
of the naive Bayes model by an 
appropriate choice of numerical 
parameters, but it is impossible for 
the naive Bayes model to represent 
the situation in Figure 3, because in 
the naive Bayes model the media 
reports are necessarily (that is, 
regardless of parameter values) inde- 
pendent when the state of Thousand- 
Dead is known. 

We would therefore expect that the 
naive Bayes model overestimates the 
selectivity (belief in the event “Thou- 
sandDead” given that the reports are 
present) and the reliability (belief in 
the negation of the event “Thousand- 
Dead” given that the reports are 
absent) of the media reports. This is 
precisely what we observe when we 
compare numerically specified ver- 
sions of the naive Bayes network of 
Figure 3 with the more accurate model 
of Figure 2, which explicitly models 
the rumor (common information 
source) present in this situation. 

In Figure 4, we show the posterior 
probability of the situation in which 
all reports are heard or seen. The 
probability that thousands are in fact 
dead is much higher in the naive 
Bayes model than in the more accu- 
rate model: the naive Bayes model 
overestimates the reliability of the 
reports. In Figure 5, we show that the 
naive Bayes model also overestimates 
the selectivity of the reports. 


IEEE INTERNET COMPUTING 


Lessons 

It is sobering to realize how difficult 
it is to assess information quality in 
a large agent system. In particular, as 
soon as uncertainty enters the pic- 
ture, the mechanism used to assess 
the truth or falsity of a proposition- 
al statement no longer suffices. In 
the jargon of an AI researcher, we 
say that compositional or truth- 
functional systems are unsound. 
Simple schemes for assessing quali- 
ty of information based on the qual- 
ity of the agent relaying the infor- 
mation simply cannot work in a 
general case. Our colleagues in fuzzy 
logic might say that this is still the 
best that can be done, and we should 
concentrate on getting the best 
mileage possible from composition- 
al systems. At least in the case of 
designed systems of agents, we 
should be able to do better! 

We should not get too pessimistic, 
however. Probabilistic models can 
also help us in managing complexi- 
ty. We might expect that, to assess 
the reliability of an information 
source (say, a sensor), it would be 
necessary to cross check the state of 
that sensor against everything else in 
the whole web (oops!) of information. 
Not so! If a Bayesian network or 
influence diagram representing our 
domain is available, the only vari- 
ables we need to assess the sensor’s 
reliability are those in the sensor’s 
Markov blanket. 

When we know the states of all 
variables in the Markov blanket, the 
sensor state is not affected by other 
variables. A research team led by 
Enrique Sucar has used an approach 
based on this observation to diagnose 
sensors in power plants.? Applications 
of these ideas to agent systems have 
not yet gone beyond the early research 
prototype stage. mC 
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Figure 4.The naive Bayes model (on the right) overestimates the reliability of 
media reports. 
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Figure 5.The naive Bayes model (right in the figure) overestimates the selectivity 
of media reports. 
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